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DETAILED ACTION 
Claim Rejections - 35 USC § 103 

1 . The following is a quotation of 35 U.S.C. 1 03(a) which fornis the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed'or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

2. Claims 1-2. 4-6. 8-9, and 11-12 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Huang et al. (US Patent No. 6593936) in view of Coorman et al. (US 
Patent No. 6665641). 

3. Regarding claim 1 , Huang et al. disclose a method for upgrading a data stream 
of multimedia data, said data stream comprising features with textual description, said 
textual description is interpreted and synthesized into spoken voice {col. 10, lines 1-67). 
Huang et al. fail to specifically disclose the method comprising including a set of 
phonetic translation hints in the data stream in addition to the textual description, and 
wherein said phonetic translation hints specify the phonetic transcription of parts or 
words of the textual description. 

However, Coorman et al. teach the method comprising including a set of 
phonetic translation hints in the data stream in addition to the textual description, and 
wherein said phonetic translation hints specify the phonetic transcription of parts or 
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words of the textual description (coA 9, lines 1-56 and also referring to the XPT section 
on col. 10, lines 20-55). 

Since Huang et al. and Coorman et al. are analogous art because they are from 
the same field of endeavors, It would have been obvious to one of ordinary skill In the 
art at the time of invention to modify Huang et al. by Incorporating the teaching of 
Coorman et al. In order to enhance speech quality. 

4. Regarding claim 2, Huang et al. fall to disclose the method according to claim 1 , 
wherein each of said phonetic translation hints is followed by a word and said phonetic 
transcription of said word. However, Coorman et al. further teach that each of said 
phonetic translation hints is followed by a word and said phonetic transcription of said 
word (CO/. 9, lines 19-25). 

Since Huang et al. and Coorman et al. are analogous art because they are from 
the same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time of invention to modify Huang et al. by incorporating the teaching of 
Coorman et al. in order to enhance speech quality. 

5. Regarding claims 4-5, the modified Huang et al. further disclose the method 
according to claim 1, wherein said phonetic translation hints are embedded in a data 
stream associated with textual type descriptors {col. 5, lines 11-67), and wherein said 
MPEG data stream is an MPEG-7 data stream {col. 5, lines 11-67). 
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6. Regarding claim 6, Huang et ai. fail to specifically disclose the method according 
to claim 1 , further comprising referring to an alphabet in a predetermined format for 
representation of phonetic transcription information. However, Coorman et al. further 
teach the step of referring to an alphabet in a predetermined format for representation of 
phonetic transcription information {col. 9, lines 1-25). 

Since Huang et al. and Coorman et al. are analogous art because they are from 
the same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time of invention to modify Huang et al. by incorporating the teaching of 
Coorman et al. in order determine/select appropriate phonetic data to represent the text. 

7. Regarding claim 8, Huang et al. fail to disclose the method according to claim 1 , 
wherein said phonetic translation hints include a limited number of. phonemes. 
However, Coorman et al. further teach that the phonetic translation hints include a 
limited number of phonemes (co/. 10, lines 1-17). 

Since Huang et al. and Coorman et al. are analogous art because they are from 
the same field of endeavors, it would have been obvious to one of ordinary skill in the 
art at the time of invention to modify Huang et al. by incorporating the teaching of 
Coorman et al. in order reduce processing time. 

8. Regarding claim 9, the modified Huang et al. fail to disclose the method 
according to claim 8, wherein said phonemes are represented with a binary fixed length 
or variable length code. However, since the system in both Huang et al. and Coorman 
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et al. is digital, phonemes must be represented by binary in either fixed/variable length. 
The advantage of this is to facilitate storage and recovering of data in digital system. 

9. Regarding claim 1 1 , the modified Huang et al. disclose the method according to 
claim 1 , further comprising storing said phonetic translation hints in a text-to-speech 
system to better identify corresponding elements of the textual description {referring 
claim 1), but fail to disclose the step of storing said phonetic translation hints in a text-to- 
speech system to better identify corresponding elements of the textual description. 
However, it would have been obvious to one of ordinary skill in the art at the time of 
invention to include this step in speech recognition system in order to reduce the load 
on speech recognizer. 

10. Regarding claim 12, Huang et al. further disclose the method according to claim 

1 1 . wherein the phonetic translation hints together with the corresponding elements of 
the textual description are implemented in text-to-speech interfaces, in which said 
textual description is used in combination with phonetic information for search or filtering 
of information {figure 9). 

1 1 . Claim 3 is rejected under 35 U.S.C. 103(a) as being unpatentable over Huang et 
al. (US Patent No. 6593936) in view of Coorman et al. (US Patent No. 6665641), as 
applied to claims 1 and 2, and further in view of Carter et al. (US Patent No. 6600814). 
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12. Regarding claim 3, the modified Huang et al. fail to disclose a method according 
to claim 1 or 2, wherein each of said phonetic translation hints with said phonetic 
transcription is valid for at least a portion of said textual description without requiring 
repetition of said phonetic transcription for each occurrence of a word, for which the 
phonetic transcription is given, in said textual description. However, Carter et al. teach 
that each of said phonetic translation hints with said phonetic transcription is valid for at 
least a portion of said textual description without requiring repetition of said phonetic 
transcription for each occurrence of a word, for which the phonetic transcription is given, 
in said textual description (co/. 4, lines 1-23). 

Since the modified Huang et al. and Carter et al. are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Huang et al. by incorporating the 
teaching of Carter et al. in order to reduce the load on the text-to-speech synthesizer. 

13. Claims 7 and 10 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Huang et al. (US Patent No. 6593936) in view of Coorman et al. (US Patent No. 
6665641 ), as applied to claim 1 , and further in view of Sharman (US Patent No. 
5682501). 

14. Regarding claim 7, the modified Huang et al. fail to disclose the method 
according to claim 6, wherein said alphabet is an international phonetic alphabet or 
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SAMPA. However, Sherman teaches that the alphabet is an intemational phonetic 
alphabet (coA 8, lines 1-9). 

Since the modified Huang et al. and Sharman are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Huang et al. by incorporating the 
teaching of Sharman in order to synthesize speech. 

15. Regarding claim 10, the modified Huang et al. fail to disclose the method 
according to claim 9, wherein coding of said phonemes takes into account statistics of 
the phonemes. However, Sharman further teaches that coding of said phonemes takes 
into account statistics of the phonemes (co/. 8, lines 28-47). 

Since the modified Huang et al. and Sharman are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Huang et al. by incorporating the 
teaching of Sharman in order to enhance accuracy in determining phonemes. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Huyen Vo whose telephone number is 703-305-8665. 
The examiner can normally be reached on M-F, 9-5:30. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Doris To can be reached on 703-305-4827. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto,gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 

Examiner Huyen X. Vo December 1 5, 2004 
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